Picture for Xi Wang

Xi Wang

MV-Actor: Aligning Multi-View Semantics and Spatial Awareness for Bimanual Manipulation

Add code
Jun 09, 2026
Viaarxiv icon

BA-T: An Iterative Transformer for Two-View Bundle Adjustment

Add code
Jun 02, 2026
Viaarxiv icon

Spatial Transcriptomics-Guided Alignment Enhances Molecular Profiling in Pathology Foundation Model

Add code
May 29, 2026
Viaarxiv icon

Cross-Modal Clinical Knowledge Integration for Mammography Report Generation

Add code
May 29, 2026
Viaarxiv icon

The 2nd EReL@MIR Workshop on Efficient Representation Learning for Multimodal Information Retrieval

Add code
May 26, 2026
Viaarxiv icon

CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

RefEvo: Agentic Design with Co-Evolutionary Verification for Agile Reference Model Generation

Add code
Apr 27, 2026
Viaarxiv icon

Bringing a Personal Point of View: Evaluating Dynamic 3D Gaussian Splatting for Egocentric Scene Reconstruction

Add code
Apr 26, 2026
Viaarxiv icon

TTS-PRISM: A Perceptual Reasoning and Interpretable Speech Model for Fine-Grained Diagnosis

Add code
Apr 24, 2026
Viaarxiv icon

Test-Time Perturbation Learning with Delayed Feedback for Vision-Language-Action Models

Add code
Apr 20, 2026
Viaarxiv icon